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1 SALSA: the stochastic approach for link-structure analysis 99% 
@ R. Lempel , S. Moran 

ACM Transactions on Information Systems (TOIS) April 2001 

Volume 19 Issue 2 

Today, when searching for information on the WWW, one 
usually performs a query through a term-based search engine. 
These engines return, as the query's result, a list of Web pages 
whose contents matches the query. For broad-topic queries, 
such searches often result in a huge set of retrieved documents, 
many of which are irrelevant to the user. However, much 
information is contained in the link-structure of the WWW. 
Information such as which pages are linked to others can be 
used to augment searc ... 

2 Authoritative sources in a hyperlinked environment 98% 
EH Jon M. Kieinberg 

Journal of the ACM (JACM) September 1999 

Volume 46 Issue 5 

The network structure of a hyperlinked environment can be a 
rich source of information about the content of the environment, 
provided we have effective means for understanding it. We 
develop a set of algorithmic tools for extracting information 
from the link structures of such environments, and report on 
experiments that demonstrate their effectiveness in a variety of 
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context on the World Wide Web. The central issue we address 
within our framework is the distillation of broad search topics, 



3 Finding authorities and hubs from link structures on the World 98% 
0 Wide Web 

Allan Borodin , Gareth 0. Roberts , Jeffrey S. Rosenthal , 
Panayiotis Tsaparas 

Proceedings of the tenth international conference on World Wide 
Web April 2001 

4 Constructing good quality web page communities 96% 
0) Jingyu Hou , Yanchun Zhang 

Australian Computer Science Communications , Proceedings of the 

thirteenth Australasian conference on Database technologies - 

Volume 5 January 2002 

Volume 24 Issue 2 

World Wide Web is a rich source of information and continues to 
expand in size and complexity. To capture the features of the 
Web at a higher level to realise the information classification 
and efficient retrieval on the Web is becoming a challenge task. 
One natural way is to exploit the linkage information among the 
Web pages. Previous work such as HITS in this area is based on 
a set of retrieved pages to get a Web community that is a bunch 
of pages related to the query topics. Since the set of ... 

5 PicASHOW: pictorial authority search by hyperlinks on the Web 95% 
@i Ronny Lempel , Aya Soffer 

Proceedings of the tenth international conference on World Wide 
Web April 2001 

6 PicASHOW: pictorial authority search by hyperlinks on the web 94% 
2j ACM Transactions on Information Systems (TOIS) January 2002 

Volume 20 Issue 1 

We describe PicASHOW, a fully automated WWW image retrieval 
system that is based on several link-structure analyzing 
algorithms. Our basic premise is that a page p displays (or links 
to) an image when the author of p considers the image to be of 
value to the viewers of the page. We thus extend some well 
known link-based WWW page retrieval schemes to the context 
of image retrieval. PicASHOW's analysis of the link structure 
enables it to retrieve relevant images even when those ... 

7 Approximation algorithms for the metric labeling problem via a 93% 
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13 new linear programming formulation 

Chandra Chekuri , Sanjeev Khanna , Joseph Naor , Leonid Zosin 
Proceedings of the twelfth annual ACM-SIAM symposium on 
Discrete algorithms January 2001 

We consider approximation algorithms for the 
metric labeling problem. This problem was 
introduced in a recent paper by Kleinberg and 
Tardos [20], and captures many classification 
problems that arise in computer vision and related 
fields. They gave an &Ogr; (log k log log k) 
approximation for the general case where k is the 
number of labels and a 2-approximation for the 
uniform metric case. More recently, Gupta and 
Tardos [15] gave a 4-approximation for the 
truncated ... 

8 Searching the Web 92% 
3) ACM Transactions on Internet Technology (TOIT) August 2001 

Volume 1 Issue 1 

We offer an overview of current Web search engine design. After 
introducing a generic search engine architecture, we examine 
each engine component in turn. We cover crawling, local Web 
page storage, indexing, and the use of link analysis for boosting 
search performance. The most common design and 
implementation techniques for each of these components are 
presented. For this presentation we draw from the literature and 
from our own experimental search engine testbed. Emphasis is 
on introduci ... 

9 Improved algorithms for topic distillation in a hyperlinked 92% 
03 environment 

Krishna Bharat , Monika R. Henzinger 

Proceedings of the 21st annual international ACM SIGIR conference 
on Research and development in information retrieval August 1998 

10 Hubs, authorities, and communities 90% 
(3 Jon M. Kleinberg 

ACM Computing Surveys (CSUR) December 1999 

11 Stable algorithms for link analysis 89% 
0 Andrew Y. Ng , Alice X. Zheng , Michael I. Jordan 
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Proceedings of the 24th annual international ACM SIGIR conference 
on Research and development in information retrieval September 
2001 

The Kleinberg HITS and the Google PageRank algorithms are 
eigenvector methods for identifying " x authoritative" or 
' x influential" articles, given hyperlink or citation information. 
That such algorithms should give reliable or consistent answers 
is surely a desideratum, and in~\cite{ijcaiPaper}, we analyzed 
when they can be expected to give stable rankings under small 
perturbations to the linkage patterns. In this paper, we extend 
the analysis and show how it gives insight into ways of de ... 

12 Constructing, organizing, and visualizing collections of topically 89% 
@) related Web resources 

Loren Terveen , Will Hill , Brian Amento 

ACM Transactions on Computer-Human Interaction (TOCHI) March 
1999 

Volume 6 Issue 1 

For many purposes, the Web page is too small a unit of 
interaction and analysis. Web sites are structured multimedia 
documents consisting of many pages, and users often are 
interested in obtaining and evaluating entire collections of 
topically related sites. Once such a collection is obtained, users 
face the challenge of exploring, comprehending and organizing 
the items. We report four innovations that address these user 
needs: (1) we replaced the Web page with the Web site 

13 On the design of a learning crawler for topical resource 88% 

13 discovery 

Charu C. Aggarwal , Fatima Al-Garawi , Philip S. Yu 

ACM Transactions on Information Systems (TOIS) July 2001 

Volume 19 Issue 3 

In recent years, the World Wide Web has shown enormous 
growth in size. Vast repositories of information are available on 
practically every possible topic. In such cases, it is valuable to 
perform topical resource discovery effectively. Consequently, 
several new ideas have been proposed in recent years; among 
them a key technique is focused crawling which is able to crawl 
particular topical portions of the World Wide Web quickly, 
without having to explore all web pages. In this paper, we 
propose ... 

14 Session 7: Fault-tolerant routing in peer-to-peer systems 88% 
@3 James Aspnes , Zoe Diamadi , Gauri Shah 

Proceedings of the twenty-first annual symposium on Principles of 
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distributed computing July 2002 

We consider the problem of designing an overlay network and 
routing mechanism that permits finding resources efficiently in 
a peer-to-peer system. We argue that many existing approaches 
to this problem can be modeled as the construction of a random 
graph embedded in a metric space whose points represent 
resource identifiers, where the probability of a connection 
between two nodes depends only on the distance between them 
in the metric space. We study the performance of a 
peer-to-peer system wher ... 

15 Simple on-line algorithms for the maximum disjoint paths 88% 
EH problem 

Petr Kolman , Christian Scheideler 

Proceedings of the thirteenth annual ACM symposium on Parallel 
algorithms and architectures July 2001 

In this paper we study the problem of finding 
disjoint paths in graphs. Whereas for specific graphs 
many (almost) matching upper and lower bounds 
are known for the competitiveness of on-line path 
selection algorithms, much less is known about how 
well on-line algorithms can perform in the general 
setting. In several papers the expansion has been 
used to measure the performance of off-line and 
on-line algorithms in this field. We study a class of 
simple deterministic on-line algorithms and sho ... 

16 Integrating content search with structure analysis for 88% 
S) hypermedia retrieval and management 

Wen-Syan Li , K. Selcuk Candan 

ACM Computing Surveys (CSUR) December 1999 

17 Link-based and content-based evidential information in a belief 88% 
13 network model 

Ilmerio Silva , Berthier Ribeiro-Neto , Pavel Calado , Edleno Moura 
, Nfvio Ziviani 

Proceedings of the 23rd annual international ACM SIGIR conference 
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